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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CCGCCGATGG CGCTGAGGCG GAGCATGGGG CGGCCGGGGC TCCCGCCGCT 50 

GCCGCTGCCG CCGCCACCGC GGCTCGGGCT GCTGCTGGCG GCTCTGGCTT 100 

CTCTGCTGCT CCCGGAGTCC GCCGCCGCAG GTCTGAAGCT CATGGGAGCC 150 

CCGGTGAAGC TGACAGTGTC TCAGGGGCAG CCGGTGAAGC TCAACTGCAG 200 

TGTGGAGGGG ATGGAGGAGC CTGACATCCA GTGGGTGAAG GATGGGGCTG 250 

TGGTCCAGAA CTTGGACCAG TTGTACATCC CAGTCAGCGA GCAGCACTGG 3 00 

ATCGGCTTCC TCAGCCTGAA GTCAGTGGAG CGCTCTGACG CCGGCCGGTA 350 

CTGGTGCCAG GTGGAGGATG GGGGTGAAAC CGAGATCTCC CAGCCAGTGT 400 

GGCTCACGGT AGAAGGTGTG CCATTTTTCA CAGTGGAGCC AAAAGATCTG 4 50 

GCAGTGCCAC CCAATGCCCC TTTCCAACTG TCTTGTGAGG CTGTGGGTCC 500 

CCCTGAACCT GTTACCATTG TCTGGTGGAG AGGAACTACG AAGATCGGGG 550 

GACCCGCTCC CTCTCCATCT GTTTTAAATG TAACAGGGGT GACCCAGAGC 600 

ACCATGTTTT CCTGTGAAGC TCACAACCTA AAAGGCCTGG CCTCTTCTCG 650 

CACAGCCACT GTTCACCTTC AAGCACTGCC TGCAGCCCCC TTCAACATCA 700 

CCGTGACAAA GCTTTCCAGC AGCAACGCTA GTGTGGCCTG GATGCCAGGT 750 

GCTGATGGCC GAGCTCTGCT ACAGTCCTGT ACAGTTCAGG TGACACAGGC 800 
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CCCAGGAGGC TGGGAAGTCC TGGCTGTTGT GGTCCCTGTG CCCCCCTTTA 850 
CCTGCCTGCT CCGGGACCTG GTGCCTGCCA CCAACTACAG CCTCAGGGTG 900 
CGCTGTGCCA ATGCCTTGGG GCCCTCTCCC TATGCTGACT GGGTGCCCTT 950 
TCAGACCAAG GGTCTAGCCC CAGCCAGCGC TCCCCAAAAC CTCCATGCCA 1000 
TCCGCACAGA TTCAGGCCTC ATCTTGGAGT GGGAAGAAGT GATCCCCGAG 1050 
GCCCCTTTGG AAGGCCCCCT GGGACCCTAC AAACTGTCCT GGGTTCAAGA 1100 
CAATGGAACC CAGGATGAGC TGACAGTGGA GGGGACCAGG GCCAATTTGA 1150 
CAGGCTGGGA TCCCCAAAAG GACCTGATCG TACGTGTGTG CGTCTCCAAT 1200 
GCAGTTGGCT GTGGACCCTG GAGTCAGCCA CTGGTGGTCT CTTCTCATGA 1250 
CCGTGCAGGC CAGCAGGGCC CTCCTCACAG CCGCACATCC TGGGTACCTG 1300 
TGGTCCTTGG TGTGCTAACG GCCCTGGTGA CGGCTGCTGC CCTGGCCCTC 1350 
ATCCTGCTTC GAAAGAGACG GAAAGAGACG CGGTTTGGGC AAGCCTTTGA 1400 
CAGTGTCATG GCCCGGGGAG AGCCAGCCGT TCACTTCCGG GCAGCCCGGT 1450 
CCTTCAATCG AGAAAGGCCC GAGCGCATCG AGGCCACATT GGACAGCTTG 1500 
GGCATCAGCG ATGAACTAAA GGAAAAACTG GAGGATGTGC TCATCCCAGA 1550 
GCAGCAGTTC ACCCTGGGCC GGATGTTGGG CAAAGGAGAG TTTGGTTCAG 1600 
TGCGGGAGGC CCAGCTGAAG CAAGAGGATG GCTCCTTTGT GAAAGTGGCT 1650 
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GTGAAGATGC TGAAAGCTGA CATCATTGCC TCAAGCGACA TTGAAGAGTT 1700 

CCTCAGGGAA GCAGCTTGCA TGAAGGAGTT TGACCATCCA CACGTGGCCA 1750 

AACTTGTTGG GGTAAGCCTC CGGAGGAGGG CTAAAGGCCG TCTCCCCATC 1800 

CCCATGGTCA TCTTGCCCTT CATGAAGCAT GGGGACCTGC ATGCCTTCCT 1850 

GCTCGCCTCC CGGATTGGGG AGAACCCCTT TAACCTACCC CTCCAGACCC 1900 

TGATCCGGTT CATGGTGGAC ATTGCCTGCG GCATGGAGTA CCTGAGCTCT 1950 

CGGAACTTCA TCCACCGAGA CCTGGCTGCT CGGAATTGCA TGCTGGCAGA 2000 

GGACATGACA GTGTGTGTGG CTGACTTCGG ACTCTCCCGG AAGATCTACA 2050 

GTGGGGACTA CTATCGTCAA GGCTGTGCCT CCAAACTGCC TGTCAAGTGG 2100 

CTGGCCCTGG AGAGCCTGGC CGACAACCTG TATACTGTGC AGAGTGACGT 2150 

GTGGGCGTTC GGGGTGACCA TGTGGGAGAT CATGACACGT GGGCAGACGC 2200 

CATATGCTGG CATCGAAAAC GCTGAGATTT ACAACTACCT CATTGGCGGG 2250 

AACCGCCTGA AACAGCCTCC GGAGTGTATG GAGGACGTGT ATGATCTCAT 2300 

GTACCAGTGC TGGAGTGCTG ACCCCAAGCA GCGCCCGAGC TTTACTTGTC 2350 

TGCGAATGGA ACTGGAGAAC ATCTTGGGCC AGCTGTCTGT GCTATCTGCC 2400 

AGCCAGGACC CCTTATACAT CAACATCGAG AGAGCTGAGG AGCCCACTGC 2450 

GGGAGGCAGC CTGGAGCTAC CTGGCAGGGA TCAGCCCTAC AGTGGGGCTG 2500 

GGGATGGCAG TGGCATGGGG GCAGTGGGTG GCACTCCCAG TGACTGTCGG 2550 
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TACATACTCA CCCCCGGAGG GCTGGCTGAG 
CCAGCCAGAG AGTCCCCTCA ATGAGACACA 
AAGGGCTACT GCCACACAGT AGCTGTTAGC 
GGCCATTTGG CCGGCTCTGG TGGCCACTGA 
CTGACCCCAG CCCAGACAGC AAGGTGTGGA 
CAAGCTGTGC TGGGAAG CCC GGACTGACCA 
TCCTGCAACC ACTCTGTGGC CAGCCTGGCA 
ATGGAAGTGG GCCAGTCCTG GTTGTCTGAA 
GGGGTGGTTA TGTTTCCATG GTTACCATGG 
AGGGCAGGTC CAGCTCTGTG GGCCCTACCC 
TGCTTAAGTG CATGCATTGA GCTGCCTCCA 
CCACACTTGG GGTTTAAATA TCCAGGTGTG 
ATGTCCTTGT AATATTCCCT TTTAGGTGAG 
CTCAGGTCTG AATCTTCACC ATCTTTCTGA 
AGGAGAAGTT GAGGGGAGCA TGCTTCCCTG 
AGGCATGCTG GAGTACCCAG CCTATCAGGT 
GTGCCGAGCC AGCAAGAGGA AGGGGTGCTG 



CAGCCAGGGC AGGCAGAGCA 2600 
GAGGCTTTTG CTGCTGCAGC 2650 
CCACAGGCAG AGGGCATCGG 2700 
GCTGGCTGAC TAAGCCCCGT 2750 
GGCTCCTGTG GTAGTCCTCC 2800 
AATCACCCAA TCCCAGTTCT 2850 
TCAGTTTAGG CCTTGGCTTG 2900 
CCCAGGCAGC TGGCAGGAGT 2950 
GTGTGGATGG CAGTGTGGGG 3000 
TCCTGCTGAG CTGCCCCTGC 3050 
GCCTGGTGGC CCAGCTATTA 3100 
CCCCTCCAAG TCACAAAGAG 3150 
GGTTGGTAAG GGGTTGGTAT 3200 
TTCCGCACCC TGCCTACGCC 3250 
CAGCTGACCG GGTCACACAA 3300 
GCCCCTCTTC CAAAGGCAGC 3350 
TGAGGCTTGC CCAGGAGCAA 3400 
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GTGAGGCCGG AGAGGAGTTC AGGAACCCTT CTCCATACCC ACAATCTGAG 3450 



CACGCTACCA AATCTCAAAA TATCCTAAGA CTAACAAAGG CAGCTGTGTC 3500 
TGAGCCCAAC CCTTCTAAAC GGTGACCTTT AGTGCCAACT TCCCCTCTAA 3550 
CTGGACAGCC TCTTCTGTCC CAAGTCTCCA GAGAGAAATC AGGCCTGATG 3600 
AGGGGGAATT C 3611 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 890 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Ala Leu Arg Arg Ser Met Gly Arg Pro Gly Leu Pro Pro Leu 
1 5 * 10 15 

Pro Leu Pro Pro Pro Pro Arg Leu Gly Leu Leu Leu Ala Ala Leu 

20 25 30 

Ala Ser Leu Leu Leu Pro Glu Ser Ala Ala Ala Gly Leu Lys Leu 

35 40 45 

Met Gly Ala Pro Val Lys Leu Thr Val Ser Gin Gly Gin Pro Val 

50 55 60 

Lys Leu Asn Cys Ser Val Glu Gly Met Glu Glu Pro Asp lie Gin 

65 70 75 

Trp Val Lys Asp Gly Ala Val Val Gin Asn Leu Asp Gin Leu Tyr 

80 85 90 

lie Pro Val Ser Glu Gin His Trp lie Gly Phe Leu Ser Leu Lys 

95 100 105 

Ser Val Glu Arg Ser Asp Ala Gly Arg Tyr Trp Cys Gin Val Glu 

110 115 120 

Asp Gly Gly Glu Thr Glu lie Ser Gin Pro Val Trp Leu Thr Val 

125 130 135 
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Glu Gly Val Pro Phe Phe 

140 



Thr Val Glu Pro Lys Asp Leu Ala Val 
145 150 



Pro Pro Asn Ala Pro Phe 

155 



Gin Leu Ser Cys Glu Ala Val Gly Pro 
160 * 165 



Pro Glu Pro Val Thr lie 

170 



Val Trp Trp Arg Gly Thr Thr Lys lie 
175 180 



Gly Gly Pro Ala Pro Ser 

185 



Pro Ser Val Leu Asn Val Thr Gly Val 
190 195 



Thr Gin Ser Thr Met Phe 

200 



Ser Cys Glu Ala His Asn Leu Lys Gly 
205 210 



Leu Ala Ser Ser Arg Thr 

215 



Ala Thr Val His Leu Gin Ala Leu Pro 
220 225 



Ala Ala Pro Phe Asn lie 

230 



Thr Val Thr Lys Leu Ser Ser Ser Asn 
235 240 



Ala Ser Val Ala Trp Met 

245 



Pro Gly Ala Asp Gly Arg Ala Leu Leu 
250 255 



Gin Ser Cys Thr Val Gin 

■ ' 260 



Val Thr Gin Ala Pro Gly Gly Trp Glu 
265 270 



Val Leu Ala Val Val Val 

275 



Pro Val Pro Pro Phe Thr Cys Leu Leu 
280 285 



Arg Asp Leu Val Pro Ala 

290 



Thr Asn Tyr Ser Leu Arg Val Arg Cys 
295 ~ 300 



Ala Asn Ala Leu Gly Pro 

305 



Ser Pro Tyr Ala Asp Trp Val Pro Phe 
310 315 



Gin Thr Lys Gly Leu Ala 

320 



Pro Ala Ser Ala Pro Gin Asn Leu His 
325 330 



Ala lie Arg Thr Asp Ser 

335 



Gly Leu lie Leu Glu Trp Glu Glu Val 
340 345 



He Pro Glu Ala Pro Leu 

350 



Glu Gly Pro Leu Gly Pro Tyr Lye Leu 
355 " 360 



Ser Trp Val Gin Asp Asn 

365 



Gly Thr Gin Asp Glu Leu Thr Val Glu 
370 375 



Gly Thr Arg Ala Asn Leu 

380 



Thr Gly Trp Asp Pro Gin Lys Asp Leu 
385 390 
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lie Val Arg Val 



Ser Gin Pro Leu 



Gly Pro Pro His 



Val Leu Thr Ala 



Leu Arg Lys Arg 



Ser Val Met Ala 



Arg Ser Phe Asn 



Asp Ser Leu Gly 



Val Leu He Pro 



Lys Gly Glu Phe 



Asp Gly Ser Phe 



He He Ala Ser 



Cys Met Lys Glu 



Val Ser Leu Arg 



Val He Leu Pro 



Leu Ala Ser Arg 



Thr Leu He Arg 



Cys Val Ser Asn Ala Val Gly Cys Gly Pro Trp 
395 400 405 

Val Val Ser Ser His Asp Arg Ala Gly Gin Gin 
410 415 420 

Ser Arg Thr Ser Trp Val Pro Val Val Leu Gly 
425 430 435 

Leu Val Thr Ala Ala Ala Leu Ala Leu He Leu 
440 445 450 

Arg Lys Glu Thr Arg Phe Gly Gin Ala Phe Asp 
455 460 465 

Arg Gly Glu Pro Ala Val His Phe Arg Ala Ala 
470 475 480 

Arg Glu Arg Pro Glu Arg He Glu Ala Thr Leu 
485 490 495 

He Ser Asp Glu Leu Lys Glu Lys Leu Glu Asp 
500 505 510 

Glu Gin Gin Phe Thr Leu Gly Arg Met Leu Gly 
515 520 525 

Gly Ser Val Arg Glu Ala Gin Leu Lys Gin Glu 
530 535 540 

Val Lys Val Ala Val Lys Met Leu Lys Ala Asp 
545 550 555 

Ser Asp He Glu Glu Phe Leu Arg Glu Ala Ala 
560 565 570 

Phe Asp His Pro His Val Ala Lys Leu Val Gly 
575 580 585 

Ser Arg Ala Lys Gly Arg Leu Pro He Pro Met 
590 595 600 

Phe Met Lys His Gly Asp Leu His Ala Phe Leu 
605 610 615 

He Gly Glu Asn Pro Phe Asn Leu Pro Leu Gin 
620 625 630 

Phe Met Val Asp He Ala Cys Gly Met Glu Tyr 
635 640 645 
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Leu Ser Ser Arg Asn Phe lie His Arg Asp Leu Ala Ala Arg Asn 

650 655 660 

Cys Met Leu Ala Glu Asp Met Thr Val Cys Val Ala Asp Phe Gly 

665 670 675 

Leu Ser Arg Lys lie Tyr Ser Gly Asp Tyr Tyr Arg Gin Gly Cys 

680 685 690 

Ala Ser Lys Leu Pro Val Lys Trp Leu Ala Leu Glu Ser Leu Ala 

695 * 700 705 

Asp Asn Leu Tyr Thr Val Gin Ser Asp Val Trp Ala Phe Gly Val 

710 715 720 

Thr Met Trp Glu lie Met Thr Arg Gly Gin Thr Pro Tyr Ala Gly 

725 730 735 

He Glu Asn Ala Glu He Tyr Asn Tyr Leu He Gly Gly Asn Arg 

740 745 750 

Leu Lys Gin Pro Pro Glu Cys Met Glu Asp Val Tyr Asp Leu Met 

755 760 765 

Tyr Gin Cys Trp Ser Ala Asp Pro Lys Gin Arg Pro Ser Phe Thr 

770 775 780 

Cys Leu Arg Met Glu Leu Glu Asn He Leu Gly Gin Leu Ser Val 

785 790 795 

Leu Ser Ala Ser Gin Asp Pro Leu Tyr He Asn He Glu Arg Ala 

800 805 810 

Glu Glu Pro Thr Ala Gly Gly Ser Leu Glu Leu Pro Gly Arg Asp 

815 820 825 

Gin Pro Tyr Ser Gly Ala Gly Asp Gly Ser Gly Met Gly Ala Val 

830 835 840 

Gly Gly Thr Pro Ser Asp Cys Arg Tyr He Leu Thr Pro Gly Gly 

845 " 850 855 

Leu Ala Glu Gin Pro Gly Gin Ala Glu His Gin Pro Glu Ser Pro 

860 865 870 

Leu Asn Glu Thr Gin Arg Leu Leu Leu Leu Gin Gin Gly Leu Leu 

875 880 885 

Pro His Ser Ser Cys 

890 
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(2) INFORMATION FOR SEQ ID NO: 3: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3637 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GAATTCTCGA GTCGACGTTG GACTTGAAGG AATGCCAAGA GATGCTGCCC 50 
CCACCCCCTT AGGCCCGAGG GATCAGGAGC TATGGGACCA GAGGCCCTGT 100 
CATCTTTACT GCTGCTGCTC TTGGTGGCAA GTGGAGATGC TGACATGAAG 150 
GGACATTTTG ATCCTGCCAA GTGCCGCTAT GCCCTGGGCA TGCAGGACCG 200 
GACCATCCCA GACAGTGACA TCTCTGCTTC CAGCTCCTGG TCAGATT CCA 250 
CTGCCGCCCG CCACAGCAGG TTGGAGAGCA GTGACGGGGA TGGGGCCTGG 300 
TGCCCCGCAG GGTCGGTGTT TCCCAAGGAG GAGGAGTACT TGCAGGTGGA 350 
TCTACAACGA CTGCACCTGG TGGCTCTGGT GGGCACCCAG GGACGGCATG 4 00 
CCGGGGGCCT GGGCAAGGAG TTCTCCCGGA GCTACCGGCT GCGTTACTCC 450 

CGGGATGGTC GCCGCTGGAT GGGCTGGAAG GACCGCTGGG GTCAGGAGGT 500 

j 

GATCTCAGGC AATGAGGACC CTGAGGGAGT GGTGCTGAAG GACCTTGGGC 550 
CCCCCATGGT TGCCCGACTG GTTCGCTTCT ACCCCCGGGC TGACCGGGTC 600 
ATGAGCGTCT GTCTGCGGGT AGAGCTCTAT GGCTGCCTCT GGAGGGATGG 650 
ACTCCTGTCT TACACCGCCC CTGTGGGGCA GACAATGTAT TTATCTGAGG 700 
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CCGTGTACCT CAACGACTCC ACCTATGACG GACATACCGT GGGCGGACTG 750 
CAGTATGGGG GTCTGGGCCA GCTGGCAGAT GGTGTGGTGG GGCTGGATGA 800 

5 

CTTTAGGAAG AGTCAGGAGC TGCGGGTCTG GCCAGGCTAT GACTATGTGG 850 
10 GATGGAGCAA CCACAGCTTC TCCAGTGGCT ATGTGGAGAT GGAGTTTGAG 900 

TTTGACCGGC TGAGGGCCTT CCAGG CTATG CAGGTCCACT GTAACAACAT 950 

15 

GCACACGCTG GGAGCCCGTC TGCCTGGCGG GGTGGAATGT CGCTTCCGGC 1000 
GTGGCCCTGC CATGGCCTGG GAGGGGGAGC CCATGCGCCA CAACCTAGGG 1050 

20 

GGCAACCTGG GGGACCCCAG AGCCCGGGCT GTCTCAGTGC CCCTTGGCGG 1100 
25 CCGTGTGGCT CGCTTTCTGC AGTGCCGCTT CCTCTTTGCG GGGCCCTGGT 1150 

TACTCTTCAG CGAAATCTCC TTCATCTCTG ATGTGGTGAA CAATTCCTCT 1200 

30 

CCGGCACTGG GAGGCACCTT CCCGCCAGCC CCCTGGTGGC CGCCTGGCCC 1250 
ACCTCCCACC AACTTCAGCA GCTTGGAGCT GGAGCCCAGA GGCCAGCAGC 1300 

35 

CCGTGGCCAA GCCCGAGGGG AGCCCGACCG CCATCCTCAT CGGCTGCCTG 1350 
40 GTGGCCATCA TCCTGCTCCT GCTGCTCATC ATTGCCCTCA TGCTCTGGCG 1400 

GCTGCACTGG CGCAGGCTCC TCAGCAAGGC TGAACGGAGG GTGTTGGAAG 1450 

45 

AGGAGCTGAC GGTTCACCTC TCTGTCCCTG GGGACACTAT CCTCATCAAC 1500 
AACCGCCCAG GTCCTAGAGA GCCACCCCCG TACCAGGAGC CCCGGCCTCG 1550 

50 

TGGGAATCCG CCCCACTCCG CTCCCTGTGT CCCCAATGGC TCTGCGTTGC 1600 
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TGCTCTCCAA TCCAGCCTAC CGCCTCCTTC TGGCCACTTA CGCCCGTCCC 1650 
CCTCGAGGCC CGGGCCCCCC CACACCCGCC TGGGCCAAAC CCACCAACAC 1700 
CCAGGCCTAC AGTGGGGACT ATATGGAGCC TGAGAAGCCA GGCGCCCCGC 1750 
TTCTGCCCCC ACCTCCCCAG AACAGCGTCC CCCATTATGC CGAGGCTGAC 1800 
ATTGTTACCC TGCAGGGCGT CACCGGGGGC AACACCTATG CTGTGCCTGC 1850 
ACTGCCCCCA GGGGCAGTCG GGGATGGGCC CCCCAGAGTG GATTTCCCTC 1900 
GATCTCGACT CCGCTTCAAG GAGAAGCTTG GCGAGGGCCA GTTTGGGGAG 1950 
GTGCACCTGT GTGAGGT CGA CAGCCCTCAA GATCTGGTCA GTCTTGATTT 2 000 
CCCCGTTAAT GTGGGTAAGG GACACCCTTT GCTGGTAGCT GTCAAGATCT 2050 
TACGGCCAGA TGCCACCAAG AATGCCAGGA ATGATTTCCT GAAAGAGGTG 2100 
AAGATCATGT CGAGGCTCAA GGACCCAAAC ATCATTCGGC TGCTGGGCGT 2150 
GTGTGTGCAG GACGACCCCC TCTGCATGAT TACTGACTAC ATGGAGAACG 2200 
GCGACCTCAA CCAGTTCCTC AGTGCCCACC AGCTGGAGGA CAAGGCAGCC 2250 
GAGGGGGCCC CTGGGGACGG GCAGGCTGCG CAGGGGCCCA CCATCAGCTA 2300 
CCCAATGCTG CTGCATGTGG CAGCCCAGAT CGCCTCCGGC ATGCGCTATC 2350 
TGGCCACACT CAACTTTGTA CATCGGGACC TGGCCACGCG GAACTGCCTA 2400 
GTTGGGGAAA ATTTCACCAT CAAAATCGCA GACTTTGGCA TGAGCCGGAA 2450 
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CCTCTATGCT GGGGACTATT ACCGTGTGCA GGGCCGGGCA GTGCTGCCCA 2500 

TCCGCTGGAT GGCCTGGGAG TGCATCCTCA TGGGGAAGTT CACGACTGCG 2550 

AGTGACGTGT GGGCCTTTGG TGTGACCCTG TGGGAGGTGC TGATGCTCTG 2600 

TAGGGCCCAG CCCTTTGGGC AGCTCACCGA CGAGCAGGTC ATCGAGAACG 2650 

CGGGGGAGTT CTTCCGGGAC CAGGGCCGGC AGGTGTACCT GTCCCGGCCG 2700 

CCTGCCTGCC CGCAGGGCCT ATATGAGCTG ATGCTTCGGT GCTGGAGCCG 2750 

GGAGTCTGAG CAGCGACCAC CCTTTTCCCA GCTGCATCGG TTCCTGGCAG 2800 

AGGATGCACT CAACACGGTG TGAATCACAC ATCCAGCTGC CCCTCCCTCA 2850 

GGGAGTGATC CAGGGGAAGC CAGTGACACT AAAACAAGAG GACACAATGG 2900 

CACCTCTGCC CTTCCCCTCC CGACAGCCCA TCACCTCTAA TAGAGGCAGT 2950 

GAGACTGCAG AAGCCCCTGT CGCCCACCCA GCTGGTCCTG TGGATGGGAT 3000 

CCTCTCCACC CTCCTCTAGC CATCCCTTGG GGAAGGGTGG GGAGAAATAT 3 050 

AGGATAGACA CTGGACATGG CCCATTGGAG CACCTGGGCC CCACTGGACA 3100 

ACACTGATTC CTGGAGAGGT GGCTGCGCCC CCAGCTTCTC TCTCCCTGTC 3150 

ACACACTGGA CCCCACTGGC TGAGAATCTG GGGGTGAGGA GGACAAGAAG 3200 

GAGAGGAAAA TGTTTCCTTG TGCCTGCTCC TGTACTTGTC CTCAGCTTGG 3250 

GCTTCTTCCT CCTCCATCAC CTGAAACACT GGACCTGGGG GTAGCCCCGC 33 00 

CCCAGCCCTC AGTCACCCCC ACTTCCCACC TGCAGTCTTG TAGCTAGAAC 3350 
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TTCTCTAAGC CTATACGTTT CTGTGGAGTA AATATTGGGA TTGGGGGGAA 3400 



AGAGGGAGCA ACGGCCCATA GCCTTGGGGT TGGACATCTC TAGTGTAGCT 3450 

( 

GCCACATTGA TTTTTCTATA ATCACTTGGG GTTTGTACAT TTTTGGGGGG 3500 
AGAGACACAG ATTTTTACAC TAATATATGG ACCTAGCTTG AGGCAATTTT 3550 
AATCCCCTGC ACTAGGCAGG TAATAATAAA GGTTGAGTTT TCCACAAAAA 3600 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAA 3637 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 913 amino acids 

(B) TYPE: amino acid 
- (D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gly Pro Glu Ala Leu Ser Ser Leu Leu Leu Leu Leu Leu Val 

1^5 10 15 

Ala Ser Gly (Asp) Ala Asp Met Lys Gly His Phe Asp Pro Ala Lys 

y 20 25 30 

Cys Arg Tyr Ala Leu Gly Met Gin Asp Arg Thr lie Pro Asp Ser 

35 , 40 45 

Asp lie Ser Ala Ser Ser Ser Trp Ser Asp Ser Thr Ala Ala Arg 

50 55 60 

His Ser Arg Leu Glu Ser Ser Asp Gly Asp Gly Ala Trp Cys Pro 

65 70 75 

Ala Gly Ser Val Phe Pro Lys Glu Glu Glu Tyr Leu Gin Val Asp 

80 85 90 

Leu Gin Arg Leu His Leu Val Ala Leu Val Gly Thr Gin Gly Arg 

95 100 105 

His Ala Gly Gly Leu Gly Lys Glu Phe Ser Arg Ser Tyr Arg Leu 

110 115 120 
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Arg Tyr Ser Arg Asp Gly Arg Arg Trp Met Gly Trp Lys Asp Arg 

125 130 135 

Trp Gly Gin Glu Val lie Ser Gly Asn Glu Asp Pro Glu Gly Val 

5 " 140 145 150 

Val Leu Lys Asp Leu Gly Pro Pro Met Val Ala Arg Leu Val Arg 

155 * 160 165 

10 Phe Tyr Pro Arg Ala Asp Arg Val Met Ser Val Cys Leu Arg Val 

170 175 180 



15 



30 



45 



Glu Leu Tyr Gly Cys Leu Trp Arg Asp Gly Leu Leu Ser Tyr Thr 

185 190 195 

Ala Pro Val Gly Gin Thr Met Tyr Leu Ser Glu Ala Val Tyr Leu 

200 205 210 



Asn Asp Ser Thr Tyr Asp Gly His Thr Val Gly Gly Leu Gin Tyr 

20 215 220 225 

Gly Gly Leu Gly Gin Leu Ala Asp Gly Val Val Gly Leu Asp Asp 

230 235 240 

25 Phe Arg Lys Ser Gin Glu Leu Arg Val Trp Pro Gly Tyr Asp Tyr 

245 250 255 



Val Gly Trp Ser Asn His Ser Phe Ser Ser Gly Tyr Val Glu Met 

260 265 270 

Glu Phe Glu Phe Asp Arg Leu Arg Ala Phe Gin Ala Met Gin Val 

275 280 285 



His Cys Asn Asn Met His Thr Leu Gly Ala Arg Leu Pro Gly Gly 

35 290 295 300 

Val Glu Cys Arg Phe Arg Arg Gly Pro Ala Met Ala Trp Glu Gly 

305 310 315 

4 0 Glu Pro Met Arg His Asn Leu Gly Gly Asn Leu Gly Asp Pro Arg 

320 325 330 



Ala Arg Ala Val Ser Val Pro Leu Gly Gly Arg Val Ala Arg Phe 

335 340 345 

Leu Gin Cys Arg Phe Leu Phe Ala Gly Pro Trp Leu Leu Phe Ser 

350 355 360 



Glu lie Ser Phe lie Ser Asp Val Val Asn Asn Ser Ser Pro Ala 
50 365 370 375 
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Leu Gly Gly Thr Phe Pro Pro Ala Pro Trp Tip Pro Pro Gly Pro 

380 385 ~ 390 

Pro Pro Thr Asn Phe Ser Ser Leu Glu Leu Glu Pro Arg Gly Gin 

395 400 405 

Gin Pro Val Ala Lys Pro Glu Gly Ser Pro Thr Ala) lie Leu lie 

410 * 415 ^3 420 

Gly Cys Leu Val Ala lie lie Leu Leu Leu Leu Leu lie lie Ala 

425 430 435 

Leu Met Leu Trp Arg Leu His Trp Arg Arg Leu Leu Ser Lys Ala 

440 445 450 

Glu Arg Arg Val Leu Glu Glu Glu Leu Thr Val His Leu Ser Val 

455 460 465 

Pro Gly Asp Thr lie Leu lie Asn Asn Arg Pro Gly Pro Arg Glu 

470 475 480 

Pro Pro Pro Tyr Gin Glu Pro Arg Pro Arg Gly Asn Pro Pro His 

485 490 " 495 

Ser Ala Pro Cys Val Pro Asn Gly Ser Ala Leu Leu Leu Ser Asn 

500 505 510 

Pro Ala Tyr Arg Leu Leu Leu Ala Thr Tyr Ala Arg Pro Pro Arg 

515 520 " 525 

Gly Pro Gly Pro Pro Thr Pro Ala Trp Ala Lys Pro Thr Asn Thr 

530 535 540 

Gin Ala Tyr Ser Gly Asp Tyr Met Glu Pro Glu Lys Pro Gly Ala 

545 550 555 

Pro Leu Leu Pro Pro Pro Pro Gin Asn Ser Val Pro His Tyr Ala 

560 565 570 

Glu Ala Asp lie Val Thr Leu Gin Gly Val Thr Gly Gly Asn Thr 

575 580 585 

Tyr Ala Val Pro Ala Leu Pro Pro Gly Ala Val Gly Asp Gly Pro 

590 595 * " 600 

Pro Arg Val Asp Phe Pro Arg Ser Arg Leu Arg Phe Lys Glu Lys 

605 610 615 

Leu Gly Glu Gly Gin Phe Gly Glu Val His Leu Cys Glu Val Asp 

620 625 630 
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Ser Pro Gin Asp Leu Val Ser Leu Asp Phe Pro Leu Asn Val Arg 

635 640 645 

Lys Gly His Pro Leu Leu Val Ala Val Lys lie Leu Arg Pro Asp 

650 655 660 

Ala Thr Lys Asn Ala Arg Asn Asp Phe Leu Lys Glu Val Lys lie 

665 ' 670 675 

Met Ser Arg Leu Lys Asp Pro Asn lie He Arg Leu Leu Gly Val 

680 685 690 

Cys Val Gin Asp Asp Pro Leu Cys Met He Thr Asp Tyr Met Glu 

695 * 700 705 

Asn Gly Asp Leu Asn Gin Phe Leu Ser Ala His Gin Leu Glu Asp 

710 715 720 

Lys Ala Ala Glu Gly Ala Pro Gly Asp Gly Gin Ala Ala Gin Gly 

725 * 730 735 

Pro Thr He Ser Tyr Pro Met Leu Leu His Val Ala Ala Gin He 

740 745 750 

Ala Ser Gly Met Arg Tyr Leu Ala Thr Leu Asn Phe Val His Arg 

- — - 755 - -- - 760 765 

Asp Leu Ala Thr Arg Asn Cys Leu Val Gly Glu Asn Phe Thr He 

770 775 780 

Lys He Ala Asp Phe Gly Met Ser Arg Asn Leu Tyr Ala Gly Asp 

785 790 795 

Tyr Tyr Arg Val Gin Gly Arg Ala Val Leu Pro He Arg Trp Met 

800 805 810 

Ala Trp Glu Cys He Leu Met Gly Lys Phe Thr Thr Ala Ser Asp 

815 820 825 

Val Trp Ala Phe Gly Val Thr Leu Trp Glu Val Leu Met Leu Cys 

830 835 840 

Arg Ala Gin Pro Phe Gly Gin Leu Thr Asp Glu Gin Val He Glu 

845 850 • 855 

Asn Ala Gly Glu Phe Phe Arg Asp Gin Gly Arg Gin Val Tyr Leu 

860 865 870 

Ser Arg Pro Pro Ala Cys Pro Gin Gly Leu Tyr Glu Leu Met Leu 

875 880 885 
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Arg Cys Trp Ser Arg Glu Ser Glu Gin Arg Pro Pro Phe Ser Gin 

890 895 900 

Leu His Arg Phe Leu Ala Glu Asp Ala Leu Asn Thr Val 
5 905 " 910 913 

(2) INFORMATION FOR SEQ ID NO: 5: 

* 

(i) SEQUENCE CHARACTERISTICS: 
.0 (A) LENGTH: 1164 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

.5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



GCAGGTCTGA AGCTCATGGG AGCCCCGGTG AAGCTGACAG TGTCTCAGGG 50 

20 

GCAGCCGGTG AAGCTCAACT GCAGTGTGGA GGGGATGGAG GAGCCTGACA 100 



TCCAGTGGGT GAAGGATGGG GCTGTGGTCC AGAACTTGGA CCAGTTGTAC 150 

25 

ATCCCAGTCA GCGAGCAGCA CTGGATCGGC TTCCTCAGCC TGAAGTCAGT 200 



30 GGAGCGCTCT GACGCCGGCC GGTACTGGTG CCAGGTGGAG GATGGGGGTG 250 



AAACCGAGAT CTCCCAGCCA GTGTGGCTCA CGGTAGAAGG TGTGCCATTT 300 

35 

TTCACAGTGG AGCCAAAAGA TCTGGCAGTG CCACCCAATG CCCCTTTCCA 350 



ACTGTCTTGT GAGGCTGTGG GTCCCCCTGA ACCTGTTACC ATTGTCTGGT 400 

40 

GGAGAGGAAC TACGAAGATC GGGGGACCCG CTCCCTCTCC ATCTGTTTTA 450 



45 AATGTAACAG GGGTGACCCA GAGCACCATG TTTTCCTGTG AAGCTCACAA 500 



CCTAAAAGGC CTGGCCTCTT CTCGCACAGC CACTGTTCAC CTTCAAGCAC 550 

50 

TGCCTGCAGC CCCCTTCAAC ATCACCGTGA CAAAGCTTTC CAGCAGCAAC 600 
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GCTAGTGTGG CCTGGATGCC AGGTG CTGAT GGCCGAGCTC TGCTACAGTC 65 0 
CTGTACAGTT CAGGTGACAC AGGCCCCAGG AGGCTGGGAA GTCCTGGCTG 700 
TTGTGGTCCC TGTGCCCCCC TTTACCTGCC TGCTCCGGGA CCTGGTGCCT 750 
GCCACCAACT ACAGCCTCAG GGTGCGCTGT GCCAATGCCT TGGGGCCCTC 800 
TCCCTATGCT GACTGGGTGC CCTTTCAGAC CAAGGGTCTA GCCCCAGCCA 850 
GCGCTCCCCA AAACCTCCAT GCCATCCGCA CAGATTCAGG CCTCATCTTG 900 
GAGTGGGAAG AAGTGATCCC CGAGGCCCCT TTGGAAGGCC CCCTGGGACC 950 
CTACAAACTG TCCTGGGTTC AAGACAATGG AACCCAGGAT GAGCTGACAG 1000 
TGGAGGGGAC CAGGGCCAAT TTGACAGGCT GGGATCCCCA AAAGGACCTG 1050 
ATCGTACGTG TGTGCGTCTC CAATGCAGTT GGCTGTGGAC CCTGGAGTCA 1100 
GCCACTGGTG GTCTCTTCTC ATGACCGTGC AGGCCAGCAG GGCCCTCCTC 1150 
ACAGCCGCAC ATCC 1164 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ala Gly Leu Lys Leu Met Gly Ala Pro Val Lys Leu Thr Val Ser 
1*5 10 15 

Gin Gly Gin Pro Val Lys Leu Asn Cys Ser Val Glu Gly Met Glu 

20 25 30 
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Glu Pro Asp lie Gin Trp Val Lys Asp Gly Ala Val Val Gin Asn 

35 40 45 

Leu Asp Gin Leu Tyr lie Pro Val Ser Glu Gin His Trp lie Gly 

50 55 60 

Phe Leu Ser Leu Lys Ser Val Glu Arg Ser Asp Ala Gly Arg Tyr 

65 • 70 75 

Trp Cys Gin Val Glu Asp Gly Gly Glu Thr Glu lie Ser Gin Pro 

80 85 90 

Val Trp Leu Thr Val Glu Gly Val Pro Phe Phe Thr Val Glu Pro 

95 100 105 

Lys Asp Leu Ala Val Pro Pro Asn Ala Pro Phe Gin Leu Ser Cys 

110 115 120 

Glu Ala Val Gly Pro Pro Glu Pro Val Thr lie Val Trp Trp Arg 

125 130 135 

Gly Thr Thr Lys lie Gly Gly Pro Ala Pro Ser Pro Ser Val Leu 

140 145 150 

Asn Val Thr Gly Val Thr Gin Ser Thr Met Phe Ser Cys Glu Ala 

155 160 165 

His Asn Leu Lys Gly Leu Ala Ser Ser Arg Thr Ala Thr Val His 

170 175 180 

Leu Gin Ala Leu Pro Ala Ala Pro Phe Asn lie Thr Val Thr Lys 

185 190 195 

Leu Ser Ser Ser Asn Ala Ser Val Ala Trp Met Pro Gly Ala Asp 

200 205 210 

Gly Arg Ala Leu Leu Gin Ser Cys Thr Val Gin Val Thr Gin Ala 

215 220 225 

Pro Gly Gly Trp Glu Val Leu Ala Val Val Val Pro Val Pro Pro 

230 235 240 

Phe Thr Cys Leu Leu Arg Asp Leu Val Pro Ala Thr Asn Tyr Ser 

245 250 255 

Leu Arg Val Arg Cys Ala Asn Ala Leu Gly Pro Ser Pro Tyr Ala 



260 



265 



270 



Asp Trp Val Pro Phe Gin 

275 



Thr Lys Gly Leu Ala Pro Ala Ser Ala 
280 285 
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Pro Gin Asn Leu His Ala lie Arg Thr Asp Ser Gly Leu lie Leu 

290 295 300 

Glu Trp Glu Glu Val lie Pro Glu Ala Pro Leu Glu Gly Pro Leu 

305 310 315 

Gly Pro Tyr Lys Leu Ser Trp Val Gin Asp Asn Gly Thr Gin Asp 

320 ' 325 330 

Glu Leu Thr Val Glu Gly Thr Arg Ala Asn Leu Thr Gly Trp Asp 

335 340 345 

Pro Gin Lys Asp Leu lie Val Arg Val Cys Val Ser Asn Ala Val 

350 " 355 360 

Gly Cys Gly Pro Trp Ser Gin Pro Leu Val Val Ser Ser His Asp 

365 370 375 

Arg Ala Gly Gin Gin Gly Pro Pro His Ser Arg Thr Ser 

380 385 388 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1197 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



GATG CTGACA TGAAGGGACA TTTTGATCCT GCCAAGTGCC GCTATGCCCT 50 
GGGCATGCAG GACCGGACCA TCCCAGACAG TGACATCTCT GCTTCCAGCT 100 
CCTGGTCAGA TTCCACTGCC GCCCGCCACA GCAGGTTGGA GAGCAGTGAC 150 



GGGGATGGGG CCTGGTGCCC CGCAGGGTCG GTGTTTCCCA AGGAGGAGGA 200 
GTACTTGCAG GTGGATCTAC AACGACTGCA CCTGGTGGCT CTGGTGGGCA 250 
CCCAGGGACG GCATGCCGGG GGCCTGGGCA AGGAGTTCTC CCGGAGCTAC 300 
CGGCTGCGTT ACTCCCGGGA TGGTCGCCGC TGGATGGGCT GGAAGGACCG 350 
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CTGGGGTCAG GAGGTGATCT CAGGCAATGA GGACCCTGAG GGAGTGGTGC 400 

TGAAGGACCT TGGGCCCCCC ATGGTTGCCC GACTGGTTCG CTTCTACCCC 450 

CGGGCTGACC GGGTCATGAG CGTCTGTCTG CGGGTAGAGC TCTATGGCTG 500 

CCTCTGGAGG GATGGACTCC TGTCTTACAC CGCCCCTGTG GGGCAGACAA 550 

TGTATTTATC TGAGGCCGTG TACCTCAACG ACTCCACCTA TGACGGACAT 600 

ACCGTGGGCG GACTGCAGTA TGGGGGTCTG GGCCAGCTGG CAGATGGTGT 650 

GGTGGGGCTG GATGACTTTA GGAAGAGTCA GGAGCTGCGG GTCTGGCCAG 700 

GCTATGACTA TGTGGGATGG AGCAACCACA GCTTCTCCAG TGGCTATGTG 750 

GAGATGGAGT TTGAGTTTGA CCGGCTGAGG GCCTTCCAGG CTATGCAGGT 800 

CCACTGTAAC AACATGCACA CGCTGGGAGC CCGTCTGCCT GGCGGGGTGG 850 

AATGTCGCTT CCGGCGTGGC CCTGCCATGG CCTGGGAGGG GGAGCCCATG 900 

CGCCACAACC TAGGGGG CAA CCTGGGGGAC CCCAGAGCCC GGGCTGTCTC 950 

AGTGCCCCTT GGCGGCCGTG TGGCTCGCTT TCTGCAGTGC CGCTTCCTCT 1000 

TTGCGGGGCC CTGGTTACTC TTCAGCGAAA TCTCCTTCAT CTCTGATGTG 1050 

GTGAACAATT CCTCTCCGGC ACTGGGAGGC ACCTTCCCGC CAGCCCCCTG 1100 

GTGGCCGCCT GGCCCACCTC CCACCAACTT CAGCAGCTTG GAGCTGGAGC 1150 
CCAGAGGCCA GCAGCCCGTG GCCAAGCCCG AGGGGAGCCC GACCGCC 1197 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 99 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Asp Ala Asp Met Lys Gly His Phe Asp Pro Ala Lys Cys Arg Tyr 
15 10 15 

Ala Leu Gly Met Gin Asp Arg Thr lie Pro Asp Ser Asp lie Ser 

20 25 30 

Ala Ser Ser Ser Trp Ser Asp Ser Thr Ala Ala Arg His Ser Arg 

35 40 45 

Leu Glu Ser Ser Asp Gly Asp Gly Ala Trp Cys Pro Ala Gly Ser 

50 55 60 

Val Phe Pro Lys Glu Glu Glu Tyr Leu Gin Val Asp Leu Gin Arg 

65 70 75 

Leu His Leu Val Ala Leu Val Gly Thr Gin Gly Arg His Ala Gly 

80 85 90 

Gly Leu Gly Lys Glu Phe Ser Arg Ser Tyr Arg Leu Arg Tyr Ser 

95 100 105 

Arg Asp Gly Arg Arg Trp Met Gly Trp Lys Asp Arg Trp Gly Gin 

110 115 120 

Glu Val lie Ser Gly Asn Glu Asp Pro Glu Gly Val Val Leu Lys 

125 130 135 

Asp Leu Gly Pro Pro Met Val Ala Arg Leu Val Arg Phe Tyr Pro 

140 145 150 

Arg Ala Asp Arg Val Met Ser Val Cys Leu Arg Val Glu Leu Tyr 

155 160 165 

Gly Cys Leu Trp Arg Asp Gly Leu Leu Ser Tyr Thr Ala Pro Val 

170 " * 175 180 

Gly Gin Thr Met Tyr Leu Ser Glu Ala Val Tyr Leu Asn Asp Ser 

185 190 195 

Thr Tyr Asp Gly His Thr Val Gly Gly Leu Gin Tyr Gly Gly Leu 

200 205 210 
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Gly Gin Leu Ala Asp Gly Val Val Gly Leu Asp Asp Phe Arg Lys 

215 220 ~ 225 

Ser Gin Glu Leu Arg Val Trp Pro Gly Tyr Asp Tyr Val Gly Trp 

230 * 235 * ~ 240 

Ser Asn His Ser Phe Ser Ser Gly Tyr Val Glu Met Glu Phe Glu 

245 250 255 

Phe Asp Arg Leu Arg Ala Phe Gin Ala Met Gin Val His Cys Asn 

260 265 270 

Asn Met His Thr Leu Gly Ala Arg. Leu Pro Gly Gly Val Glu Cys 

275 280 285 

Arg Phe Arg Arg Gly Pro Ala Met Ala Trp Glu Gly Glu Pro Met 

290 295 300 

Arg His Asn Leu Gly Gly Asn Leu Gly Asp Pro Arg Ala Arg Ala 

305 310 " " 315 

Val Ser Val Pro Leu Gly Gly Arg Val Ala Arg Phe Leu Gin Cys 

320 325 330 

Arg Phe Leu Phe Ala Gly Pro Trp Leu Leu Phe Ser Glu lie Ser 

335 340 345 

Phe lie Ser Asp Val Val Asn Asn Ser Ser Pro Ala Leu Gly Gly 

350 355 360 

Thr Phe Pro Pro Ala Pro Trp Trp Pro Pro Gly Pro Pro Pro Thr 

365 370 375 

Asn Phe Ser Ser Leu Glu Leu Glu Pro Arg Gly Gin Gin Pro Val 

380 385 * 390 

Ala Lys Pro Glu Gly Ser Pro Thr Ala 

395 399 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3785 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CCTCCGCCAC CCTCCTCTCA GCGCTCGCGG GCCGGGCCCG GCATGGTGCG 50 
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CGTCGCCGCC GATGGCGCTG AGGCGGAGCA TGGGGTGGCC GGGGCTCCGG 100 

CCGCTGCTGC TGGCGGGACT GGCTTCTCTG CTGCTCCCCG GGTCTGCGGC 150 

CGCAGGCCTG AAGCTCATGG GCGCCCCAGT GAAGATGACC GTGTCTCAGG 200 

GGCAGCCAGT GAAGCTCAAC TGCAGCGTGG AGGGGATGGA GGACCCTGAC 250 

ATCCACTGGA TGAAGGATGG CACCGTGGTC CAGAATGCAA GCCAGGTGTC 300 

CATCTCCATC AGCGAGCACA GCTGGATTGG CTTACTCAGC CTAAAGTCAG 350 

TGGAGCGGTC TGATGCTGGC CTGTACTGGT GCCAGGTGAA GGATGGGGAG 400 

GAAACCAAGA TCTCTCAGTC AGTATGGCTC ACTGTCGAAG GTGTGCCATT 450 

CTTCACAGTG GAACCAAAAG ATCTGGCGGT GCCACCCAAT GCCCCTTTTC 500 

AGCTGTCTTG TGAGGCTGTG GGTCCTCCAG AACCCGTAAC CATTTACTGG 550 

TGGAGAGGAC TCACTAAGGT TGGGGGACCT GCTCCCTCTC CCTCTGTTTT 600 

AAATGTGACA GGAGTGACCC AGCGCACAGA GTTTTCTTGT GAAGCCCGCA 650 

ACATAAAAGG CCTGGCCACT TCCCGACCAG CCATTGTTCG CCTTCAAGCA 700 

CCGCCTGCAG CTCCTTTCAA CACCACAGTA ACAACGATCT CCAGCTACAA 750 

CGCTAGCGTG GCCTGGGTGC CAGGTGCTGA CGGCCTAGCT CTGCTGCATT 800 

CCTGTACTGT ACAGGTGGCA CACGCCCCAG GAGAATGGGA GGCCCTTGCT 850 

GTTGTGGTTC CTGTGCCACC TTTTACCTGC CTGCTTCGGA ACTTGGCCCC 900 

TGCCACCAAC TACAGCCTTA GGGTGCGCTG TGCCAATGCC TTGGGCCCTT 950 
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CTCCCTACGG CGACTGGGTG CCCTTTCAGA 
AGAGCTCCTC AGAATTTCCA TGCCATTCGT 
GGAATGGGAA GAAGTGATTC CTGAAGACCC 
CTTATAAGCT GTCCTGGGTC CAAGAAAATG 
GTGGAAGGGA CCAGGGCCAA TCTGACCGAC 
GATTTTGCGT GTGTGTGCCT CCAATGCAAT 
AGCCACTGGT GGTGTCTTCT CATGACCATG 
CACAGCCGCA CATCCTGGGT GCCTGTGGTC 
GATCACAGCT GCTGCCTTGG CCCTCATCCT 
AGACGCGTTT CGGGCPAGCC TTTGACAGTG 
GCTGTACACT TCCGGGCAGC CCGATCTTTC 
CATTGAGGCC ACATTGGATA GCCTGGGCAT 
AGCTGGAGGA TGTCCTCATT CCAGAGCAGC 
TTGGGCAAAG GAGAGTTTGG ATCAGTGCGG 
AGATGGCTCC TTCGTGAAAG TGGCAGTGAA 
TTGCCTCAAG CGACATAGAA GAGTTCCTCC 
GAGTTTGACC ATCCACACGT GGCCAAGCTT 



CAAAGGGCCT AGCGCCAGCC 1000 
ACCGACTCAG GCCTTATCCT 1050 
TGGGGAAGGC CCCCTAGGAC 1100 
GAACCCAGGA TGAGCTGATG 1150 
TGGGATCCCC AGAAGGACCT 1200 
TGGTGATGGG CCCTGGAGTC 1250 
CAGGGAGGCA GGGCCCTCCC 1300 
CTGGGCGTGC TCACCGCCCT 1350 
GCTTCGGAAG AGACGCAAGG 1400 
TCATGGCCCG AGGGGAGCCA 1450 
AATCGAGAAA GGCCTGAACG 1500 
CAGCGATGAA TTGAAGGAAA 1550 
AGTTCACCCT CGGTCGGATG 1600 
GAAGCCCAGC TAAAGCAGGA 1650 
GATGCTGAAA GCTGACATCA 1700 
GGGAAGCAGC TTGCATGAAG 1750 
GTTGGGGTGA GCCTCCGGAG 1800 
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CAGGGCTAAA GGTCGTCTCC CCATTCCCAT GGTCATCCTG CCCTTCATGA 1850 

AACATGGAGA CTTGCACGCC TTTCTGCTCG CCTCCCGAAT CGGGGAGAA.C 1900 

CCTTTTAACC TGCCCCTGCA GACCCTGGTC CGGTTCATGG TGGACATTGC 1950 

CTGTGGCATG GAGTACCTGA GCTCCCGGAA CTTCATCCAC CGAGACCTAG 2000 

CAGCTCGGAA TTGCATGCTG GCCGAGGACA TGACAGTGTG TGTGGCTGAT 2050 

TTTGGACTCT CTCGGAAAAT CTATAGCGGG GACTATTATC GTCAGGGCTG 2100 

TGCCTCCAAA TTGCCCGTCA AGTGGCTGGC CCTGGAGAGC TTGGCTGACA 2150 

ACTTGTATAC TGTACACAGT GATGTGTGGG CCTTCGGGGT GACCATGTGG 2200 

GAGATCATGA CTCGTGGGCA GACGCCATAT GCTGGCATTG AAAATGCTGA 2250 

GATTTACAAC TACCTCATCG GCGGGAACCG CCTGAAGCAG CCTCCGGAGT 23 00 

GCATGGAGGA AGTGTATGAT CTCATGTACC AGTGCTGGAG CGCCGACCCC 2350 

AAGCAGCGCC CAAGCTTCAC GTGTCTGCGA ATGGAACTGG AGAACATTCT 2400 

GGGCCACCTG TCTGTGCTGT CCACCAGCCA GGACCCCTTG TACATCAACA 2450 

TTGAGAGAGC TGAGCAGCCT ACTGAGAGTG GCAGCCCTGA GCTGCACTGT 2500 

GGAGAGCGAT CCAGCAGCGA GGCAGGGGAC GGCAGTGGCG TGGGGGCAGT 2550 

AGGTGGCATC CCCAGTGACT CTCGGTACAT CTTCAGCCCC GGAGGGCTAT 2600 

CCGAGTCACC AGGGCAGCTG GAGCAGCAGC CAGAAAGCCC CCTCAATGAG 2650 

AACCAGAGGC TGTTGTTGCT GCAGCAAGGG CTACTGCCTC ACAGTAGCTG 2700 

124 



TTAACCCTCA GGCAGAGGAA AGTTGGGGCC CCTGGCTCTG CTGACCGCTG 2750 
CGCTGCCTGA CTAGGCCCAG TCTGATCACA GCCCAGGCAG CAAGGTATGG 2800 
AC3GCTCCTGT GGTAGCCCTC CCAAGCTGTG TGGCGCCTGG ACGGACCAAA 2850 
TTGCCCAATC CCAGTTCTTC CTGCAGCCGC TCTGGCCAGC CTGGCATCAG 2900 
TTCAGGCCTT GGCTTAGAGG AGGTGAGCCA GAGCTGGTTG CCTGAATGCA 2950 
GGCAGCTGGC AGGAGGGGAG GGTGGCTATG TTTCCATGGG TACCATGGGT 3000 
GTGGATGGCA GTAAGGGAGG GTAGCAACAG CCTGTGGGCC CCTACCCTCC 3050 
TGGCTGAGCT GCTCCTACTT TAGTGCATGC TTGGAGCCGC CTGCAGCCTG 3100 
GAACTCAGCA CTGCCCACCA CACTTGGGCC GAAATGCCAG GTTTGCCCCT 3150 
CTTAAGTCAC AAAGAGATGT CCATGTATTG TTCCCTTTTA GGTGATGATT 3200 
AGGAAGGGAT TGGCACACTT GGGTCCCTAA GCCCTATGGC AGGAAATGGT 3250 
GGGATATTCT CAGGTCTGAA TCCTCATCAT CTTCCTGATT CCCCACCCTG 3300 
CAAAGGCCTG GAACTGGCTG TGGGGCTCTG ACGCATGCTG AAGGACAAAA 3350 
GGTTACAGAG ATCCGACTTC AAAAGGCAGG GTCTGAGTCT GGCAGGTGGA 3400 
GAGGTGCTAA GGGGCTGGCC CAGGAGTCAG GCATTTCAGG ACCCCTCCAA 3450 
GCTTCTACAG TCTGTCTGAG CATGCTACCA AGCCCCCAGA TACCCCAAAA 3500 
CTAACAGAGG CAGTTTTGTC TGAGCCCAGC CCTCCCACAT GATGACCCTT 3550 
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AGGTCTACCC TCCTCTCTAA ATGGACATCC TCGTTTGTCC CAAGTCTCCA 3600 
GAGAGACTAC TGATGGCTGA TGTGGGTAAG AAAAGTTCCA GGAACCAGGG 3 650 
CTGGGGTGGA ACCAGGGCTG GGGTCGAGGC AGGCTCTTGG GCAGGCTCTT 3700 
GCTGTTAGGA ACATTTCTAA GCTATTAAGT TGCTGTTTCA AAACAAATAA 3750 
AATTGAAACA TAAAGAATCA AAAAAAAAAA AAAAA 3785 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 880 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Leu Arg Arg Ser Met Gly Trp Pro Gly Leu Arg Pro Leu 
1 5 10 15 

Leu Leu Ala Gly Leu Ala Ser Leu Leu Leu Pro Gly Ser Ala Ala 

20 25 30 

Ala Gly Leu Lys Leu Met Gly Ala Pro Val Lys Met Thr Val Ser 

35 40 45 

Gin Gly Gin Pro Val Lys Leu Asn Cys Ser ValGlu Gly Met Glu 

50 55 60 

Asp Pro Asp lie His Trp Met Lys Asp Gly Thr Val Val Gin Asn 

65 70 75 

Ala Ser Gin Val Ser He Ser He Ser Glu His Ser Trp He Gly 

80 85 90 

Leu Leu Ser Leu Lys Ser Val Glu Arg Ser Asp Ala Gly Leu Tyr 

95 100 105 

Trp Cvs Gin Val Lys Asp Gly Glu Glu Thr Lys He Ser Gin Ser 
* 110 ~ 115 120 

Val Trp Leu Thr Val Glu Gly Val Pro Phe Phe Thr Val Glu Pro 

125 130 135 
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Lys Asp Leu Ala Val Pro Pro Asn Ala Pro Phe Gin Leu Ser Cys 

140 145 150 

Glu Ala Val Gly Pro Pro Glu Pro Val Thr He Tyr Trp Trp Arg 
5 155 160 165 

Gly Leu Thr Lys Val Gly Gly Pro Ala Pro Ser Pro Ser Val Leu 

170 ' 175 180 

10 Asn Val Thr Gly Val Thr Gin Arg Thr Glu Phe Ser Cys Glu Ala 

185 190 195 



15 



30 



45 



Arg Asn He Lys Gly Leu Ala Thr Ser Arg Pro Ala He Val Arg 

200 205 210 

Leu Gin Ala Pro Pro Ala Ala Pro Phe Asn Thr Thr Val Thr Thr 

215 220 225 



lie Ser Ser Tyr Asn Ala Ser Val Ala Trp Val Pro Gly Ala Asp 
20 230 235 240 

Gly Leu Ala Leu Leu His Ser Cys Thr Val Gin Val Ala His Ala 

245 250 255 

25 Pro Gly Glu Trp Glu Ala Leu Ala Val Val Val Pro Val Pro Pro 

260 265 270 



Phe Thr Cys Leu Leu Arg Asn Leu Ala Pro Ala Thr Asn Tyr Ser 

275 280 " 285 

Leu Arg Val Arg Cys Ala Asn Ala Leu Gly Pro Ser Pro Tyr Gly 

290 295 * 300 



Asp Trp Val Pro Phe Gin Thr Lys Gly Leu Ala Pro Ala Arg Ala 
35 305 310 315 

Pro Gin Asn Phe His Ala He Arg Thr Asp Ser Gly Leu He Leu 

320 325 330 

40 Glu Trp Glu Glu Val He Pro Glu Asp Pro Gly Glu Gly Pro Leu 

335 340 * 345 



Gly Pro Tyr Lys Leu Ser Trp Val Gin Glu Asn Gly Thr Gin Asp 

350 355 360 

Glu Leu Met Val Glu Gly Thr Arg Ala Asn Leu Thr Asp Trp Asp 

365 370 375 



Pro Gin Lys Asp Leu He Leu Arg Val Cys Ala Ser Asn Ala He 
50 " 380 385 390 
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Gly Asp Gly Pro Trp Ser Gin Pro Leu Val Val Ser Ser His Asp 

395 400 405 

His Ala Gly Arg Gin Gly Pro Pro His Ser Arg Thr Ser -Trp Val 

410 415 420 

Pro Val Val Leu Gly Val Leu Thr Ala Leu lie Thr Ala Ala Ala 

425 . 430 435 

Leu Ala Leu lie Leu Leu Arg Lys Arg Arg Lys Glu Thr Arg Phe 

440 445 450 

Gly Gin Ala Phe Asp Ser Val Met Ala Arg Gly Glu Pro Ala Val 

455 460 465 

His Phe Arg Ala Ala Arg Ser Phe Asn Arg Glu Arg Pro Glu Arg 

470 475 480 

lie Glu Ala Thr Leu Asp Ser Leu Gly He Ser Asp Glu Leu Lys 

485 490 495 

Glu Lys Leu Glu Asp Val Leu He Pro Glu Gin Gin Phe Thr Leu 

500 505 510 

Gly Arg Met Leu Gly Lys Gly Glu Phe Gly Ser Val Arg Glu Ala 

515 520 525 

Gin Leu Lys Gin Glu Asp Gly Ser Phe Val Lys Val Ala Val Lys 

530 535 540 

Met Leu Lys Ala Asp He He Ala Ser Ser Asp He Glu Glu Phe 

545 550 555 

Leu Arg Glu Ala Ala Cys Met Lys Glu Phe Asp His Pro His Val 

560 565 570 

Ala Lys Leu Val Gly Val Ser Leu Arg Ser Arg Ala Lys Gly Arg 

575 580 585 

Leu Pro He Pro Met Val He Leu Pro Phe Met Lys His Gly Asp 

590 595 600 

Leu His Ala Phe Leu Leu Ala Ser. Arg He Gly Glu Asn Pro Phe 

605 610 615 

Asn Leu Pro Leu Gin Thr Leu Val Arg Phe Met Val Asp lie Ala 

620 625 630 

Cvs Glv Met Glu Tyr Leu Ser Ser Arg Asn Phe He His Arg Asp 
y/ * 635 640 645 
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Leu Ala Ala Arg Asn Cys Met Leu Ala Glu Asp Met Thr Val Cys 

650 655 660 

Val Ala Asp Phe Gly Leu Ser Arg Lys He Tyr Ser Gly Asp Tyr 
5 665 670 675 

Tyr Arg Gin Gly Cys Ala. Ser Lys Leu Pro Val Lys Trp Leu Ala 

680 685 690 

10 Leu Glu Ser Leu Ala Asp Asn Leu Tyr Thr Val His Ser Asp Val 

695 700 705 

Trp Ala Phe Gly Val Thr Met Trp Glu He Met Thr Arg Gly Gin 

710 715 720 

15 

Thr Pro Tyr Ala Gly He Glu Asn Ala Glu He Tyr Asn Tyr Leu 

725 730 735 

He Gly Gly Asn Arg Leu Lys Gin Pro Pro Glu Cys Met Glu Glu 
20 " 740 745 750 

Val Tyr Asp Leu Met Tyr Gin Cys Trp Ser Ala Asp Pro Lys Gin 

755 760 765 

25 Arg -Pro Ser Phe Thr Cys Leu Arg Met Glu Leu Glu Asn He Leu 

770 775 780 

Gly His Leu Ser Val Leu Ser Thr Ser Gin Asp Pro Leu Tyr He 

785 790 795 

30 

Asn He Glu Arg Ala Glu Gin Pro Thr Glu Ser Gly Ser Pro Glu 

800 805 810 

Leu His Cys Gly Glu Arg Ser Ser Ser Glu Ala Gly Asp Gly Ser 
35 815 820 825 

Gly Val Gly Ala Val Gly Gly He Pro Ser Asp Ser Arg Tyr He 

830 835 840 

40 Phe Ser Pro Gly Gly Leu Ser Glu Ser Pro Gly Gin Leu Glu Gin 

845 850 855 

Gin Pro Glu Ser Pro Leu Asn Glu Asn Gin Arg Leu Leu Leu Leu 

860 865 870 

45 

Gin Gin Gly Leu Leu Pro His Ser Ser Cys 

875 880 

(2) INFORMATION FOR SEQ ID NO: 11: 

50 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1164 bases 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GCAGGCCTGA AGCTCATGGG CGCCCCAGTG AAGATGACCG TGTCTCAGGG 50 

L0 

GCAGCCAGTG AAGCTCAACT GCAGCGTGGA GGGGATGGAG GACCCTGACA 100 
TCCACTGGAT GAAGGATGGC ACCGTGGTCC AGAATGCAAG CCAGGTGTCC 150 
ATCTCCATCA GCGAGCACAG CTGGATTGGC TTACTCAGCC TAAAGTCAGT 200 
20 GGAGCGGTCT GATGCTGGCC TGTACTGGTG CCAGGTGAAG GATGGGGAGG 250 

AAACCAAGAT CTCTCAGTCA GTATGGCTCA CTGTCGAAGG TGTGCCATTC 300 

25 

TTCACAGTGG AACCAAAAGA TCTGGCGGTG CCACCCAATG CCCCTTTTCA 350 
GCTGTCTTGT GAGGCTGTGG GTCCTCCAGA ACCCGTAACC ATTTACTGGT 400 

30 

GGAGAGGACT CACTAAGGTT GGGGGACCTG CTCCCTCTCC CTCTGTTTTA 450 
35 AATGTGACAG GAGTGACCCA GCGCACAGAG TTTTCTTGTG AAGCCCGCAA 500 

CATAAAAGGC CTGGCCACTT CCCGACCAGC CATTGTTCGC CTTCAAGCAC 550 

40 

CGCCTGCAGC TCCTTTCAAC ACCACAGTAA CAACGATCTC CAGCTACAAC 600 
GCTAGCGTGG CCTGGGTGCC AGGTGCTGAC GGCCTAGCTC TGCTGCATTC 650 

45 

CTGTACTGTA CAGGTGGCAC ACGCCCCAGG AGAATGGGAG GCCCTTGCTG 700 



50 TTGTGGTTCC TGTGCCACCT TTTACCTGCC TGCTTCGGAA CTTGGCCCCT 750 
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GCCACCAACT ACAGCCTTAG GGTGCGCTGT GCCAATGCCT TGGGCCCTTC 800 



TCCCTACGGC GACTGGGTGC CCTTTCAGAC AAAGGGCCTA GCGCCAGCCA 850 



GAGCTCCTCA GAATTTCCAT GCCATTCGTA CCGACTCAGG CCTTATCCTG 900 



GAATGGGAAG AAGTGATTCC TGAAGACCCT GGGGAAGGCC CCCTAGGACC 950 



TTATAAGCTG TCCTGGGTCC AAGAAAATGG AACCCAGGAT GAGCTGATGG 1000 



TGGAAGGGAC CAGGGCCAAT CTGACCGACT GGGATCCCCA GAAGGACCTG 1050 



ATTTTGCGTG TGTGTGCCTC CAATGCAATT GGTGATGGGC CCTGGAGTCA 1100 



GCCACTGGTG GTGTCTTCTC ATGACCATGC AGGGAGGCAG GGCCCTCCCC Il50 



ACAGCCGCAC ATCC 1164 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 88 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

Ala Gly Leu Lys Leu Met Gly Ala Pro Val Lys Met Thr Val Ser 
1 5 10 15 

Gin Gly Gin Pro Val Lys Leu Asn Cys Ser Val Glu Gly Met Glu 

20 25 30 

Asp Pro Asp lie His Trp Met Lys Asp Gly Thr Val Val Gin Asn 

35 40 45 

Ala Ser Gin Val Ser lie Ser lie Ser Glu His Ser Trp lie Gly 

50 55 60 

Leu Leu Ser Leu Lys Ser Val Glu Arg Ser Asp Ala Gly Leu Tyr 

65 70 75 
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Trp Cys Gin Val Lys Asp Gly Glu Glu Thr Lys lie Ser Gin Ser 

80 85 90 

Val Trp Leu Thr Val Glu Gly Val Pro Phe Phe Thr Val Glu Pro 

5 95 100 105 

Lys Asp Leu Ala Val Pro Pro Asn Ala Pro Phe Gin Leu Ser Cys 

110 115 120 

L0 Glu Ala Val Gly Pro Pro Glu Pro Val Thr lie Tyr Trp Trp Arg 

125 130 135 

Gly Leu Thr Lys Val Gly Gly Pro Ala Pro Ser Pro Ser Val Leu 

140 " 145 150 

15 

Asn Val Thr Gly Val Thr Gin Arg Thr Glu Phe Ser Cys Glu Ala 

155 160 165 

Arg Asn lie Lys Gly Leu Ala Thr Ser Arg Pro Ala lie Val Arg 

20 " 170 175 180 

Leu Gin Ala Pro Pro Ala Ala Pro Phe Asn Thr Thr Val Thr Thr 

185 190 ' 195 

25 lie Ser Ser Tyr Asn Ala Ser Val Ala Trp Val Pro Gly Ala Asp 

200 205 210 

Gly Leu Ala Leu Leu His Ser Cys Thr Val Gin Val Ala His Ala 

215 220 225 

30 

Pro Gly Glu Trp Glu Ala Leu Ala Val Val Val Pro Val Pro Pro 

230 235 240 

Phe Thr Cys Leu Leu Arg Asn Leu Ala Pro Ala Thr Asn Tyr Ser 

35 ~ 245 ~ 250 ' 255 

Leu Arg Val Arg Cys Ala Asn Ala Leu Gly Pro Ser Pro Tyr Gly 

260 265 270 

40 Asp Trp Val Pro Phe Gin Thr Lys Gly Leu Ala Pro Ala Arg Ala 

275 280 285 

Pro Gin Asn Phe His Ala lie Arg Thr Asp Ser Gly Leu lie Leu 

290 295 300 

45 

Glu Trp Glu Glu Val lie Pro Glu Asp Pro Gly Glu Gly Pro Leu 

305 310 315 

Gly Pro Tyr Lys Leu Ser Trp Val Gin Glu Asn Gly Thr Gin Asp 

50 " - 32Q 325 33Q 
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Glu Leu Met Val Glu Gly Thr Arg Ala As n Leu Thr Asp Trp Asp 

335 340 345 

Pro Gin Lys Asp Leu lie Leu Arg Val Cys Ala Ser Asn Ala lie 

350 " 355 360 

Gly Asp Gly Pro Trp Ser Gin Pro Leu Val Val Ser Ser His Asp 

365 ' 370 375 

His Ala Gly Arg Gin Gly Pro Pro His Ser Arg Thr Ser 

380 385 388 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CGGATCCACA CGATGCGACT CTT 23 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GGAATTCCTC TCATGGAGCT AGTCCATCTC T 31 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
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CGGATCCATC CACAGAGATG T 21 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



GGAATTCCAA AGGACCAGCA CGATC 25 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



GACCGTGTGT GTGGCTGACT TTGGACTCTC CTGGAAGATC 40 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 0 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



GGCTGTGCCT CCAAATTGCC CGTCAAGTGG CTGGCCCTGG 40 



(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 50 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



AGCCGGTGAA GCTGAACTGC AGTGTGGAGG GGATGGAGGA GCCTGACATC 50 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 



TCCAGCTACA ACGCTAGCGT GGCCTGGGTG CCAGGTGCTG ACGGCCTAGC 50 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

lie His Arg Asp Leu Ala Ala Arg Asn 
15 9 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Lys Trp lie Ala lie Glu 
1 5 6 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Lys Thr Trp Thr Met Ala Pro Glu 
1 5 8 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

Lys Trp Leu Ala Leu Glu 
1 5 6 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 

Lys Trp Met Ala Leu Glu 
1 5 6 

(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 



CAGCTGCTCG AGGCAGGTCT GAAGCTCATG 30 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27 



GCATGAATTC ATGGCACACC TTCTACCGTG 30 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 



CACTGAGCTG GCTGACTAAG 20 



(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 



CCTGATAGGC TGGGTACTCC 20 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



AAGCCCGGAC TGACCAAA 18 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



GTGCGGAATC AGAAAGATGG 20 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 



TCAAGACAAT GGAACCCA 18 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 



CATGGAATTC GGTGACCGAT GTGCGGCTGT GAGGAG 36 
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(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Ala Trp Arg Cys Pro Arg Met Gly Arg Val Pro Leu Ala Trp 
1 5 10 15 

Cys Leu Ala Leu Cys Gly Trp Ala Cys Met Ala Pro Arg Gly Thr 

20 25 30 

Gin Ala Glu Glu Ser Pro Phe Val Gly Asn Pro Gly Asn lie Thr 

35 40 45 

Gly Ala Arg Gly Leu Thr Gly Thr Leu Arg Cys Gin Leu Gin Val 

50 55 60 

Gin Gly Glu Pro Pro Glu Val His Trp Leu Arg Asp Gly Gin lie 

65 70 * 75 

Leu Glu Leu Ala Asp Ser Thr Gin Thr Gin Val Pro Leu Gly Glu 

80 85 * 90 

Asp Glu Gin Asp Asp Trp lie Val Val Ser Gin Leu Arg lie Thr 

95 100 105 

Ser Leu Gin Leu Ser Asp Thr Gly Gin Tyr Gin Cys Leu Val Phe 

110 115 120 

Leu Gly His Gin Thr Phe Val Ser Gin Pro Gly Tyr Val Gly Leu 

125 130 135 

Glu Gly Leu Pro Tyr Phe Leu Glu Glu Pro Glu Asp Arg Thr Val 

140 145 150 

Ala Ala Asn Thr Pro Phe Asn Leu Ser Cys Gin Ala Gin Gly Pro 

155 160 165 

Pro Glu Pro Val Asp Leu Leu Trp Leu Gin Asp Ala Val Pro Leu 

170 175 180 

Ala Thr Ala Pro Gly His Gly Pro Gin Arg Ser Leu His Val Pro 

185 190 195 

Gly Leu Asn Lys Thr Ser Ser Phe Ser Cys Glu Ala His Asn Ala 

200 205 210 
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Lys Gly Val Thr Thr Ser Arg Thr Ala Thr lie Thr Val Leu Pro 

215 220 225 

Gin Gin Pro Arg Asn Leu His Leu Val Ser Arg Gin Pro Thr Glu 

230 235 240 

Leu Glu Val Ala Trp Thr Pro Gly Leu Ser Gly lie Tyr Pro Leu 

245 ' 250 255 

Thr His Cys Thr Leu Gin Ala Val Leu Ser Asp Asp Gly Met Gly 

260 265 270 

lie Gin Ala Gly Glu Pro Asp Pro Pro Glu Glu Pro Leu Thr Ser 

275 280 285 

Gin Ala Ser Val Pro Pro His Gin Leu Arg Leu Gly Ser Leu His 

290 295 300 

Pro His Thr Pro Tyr His lie Arg Val Ala Cys Thr Ser Ser Gin 

305 310 315 

Gly Pro Ser Ser Trp Thr His Trp Leu Pro Val Glu Thr Pro Glu 

320 ~ 325 330 

Gly Val Pro Leu Gly Pro Pro Glu Asn lie Ser Ala Thr Arg Asn 

335 340 345 

Gly Ser Gin Ala Phe Val His Trp Gin Glu Pro Arg Ala Pro Leu 

350 355 ~ 360 

Gin Gly Thr Leu Leu Gly Tyr Arg Leu Ala Tyr Gin Gly Gin Asp 

365 " 370 375 

Thr Pro Glu Val Leu Met Asp lie Gly Leu Arg Gin Glu Val Thr 

380 385 390 

Leu Glu Leu Gin Gly Asp Gly Ser Val Ser Asn Leu Thr Val Cys 

395 400 405 

Val Ala Ala Tyr Thr Ala Ala Gly Asp Gly Pro Trp Ser Leu Pro 

410 415 420 

Val Pro Leu Glu Ala Trp Arg Pro Gly Gin Ala Gin Pro Val His 

425 430 435 

Gin Leu Val Lys Glu Pro Ser Thr Pro Ala Phe Ser Trp Pro Trp 

440 445 450 

Trp Tyr Val Leu Leu Gly Ala Val Val Ala Ala Ala Cys Val Leu 

455 460 465 
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lie Leu Ala Leu Phe Leu Val His Arg Arg Lys Lys Glu Thr Arg 

470 475 480 

Tyr Gly Glu Val Phe Glu Pro Thr Val Glu Arg Gly Glu Leu Val 

485 490 495 

Val Arg Tyr Arg Val Arg Lys Ser Tyr Ser Arg Arg Thr Thr Glu 

500 • * 505 ~ 510 

Ala Thr Leu Asn Ser Leu Gly lie Ser Glu Glu Leu Lys Glu Lys 

515 520 525 

Leu Arg Asp Val Met Val Asp Arg His Lys Val Ala Leu Gly Lys 

530 535 540 

Thr Leu Gly Glu Gly Glu Phe Gly Ala Val Met Glu Gly Gin Leu 

545 550 555 

Asn Gin Asp Asp Ser lie Leu Lys Val Ala Val Lys Thr Met Lys 

560 565 570 

lie Ala He Cys Thr Arg Ser Glu Leu Glu Asp Phe Leu Ser Glu 

575 580 585 

Ala Val Cys Met Lys Glu Phe Asp His Pro Asn Val Met Arg Leu 

590 595 600 

He Gly Val Cys Phe Gin Gly Ser Glu Arg Glu Ser Phe Pro Ala 

605 610 615 

Pro Val Val He Leu Pro Phe Met Lys His Gly Asp Leu His Ser 

620 625 630 

Phe Leu Leu Tyr Ser Arg Leu Gly Asp Gin Pro Val Tyr Leu Pro 

635 " 640 * 645 

Thr Gin Met Leu Val Lys Phe Met Ala Asp He Ala Ser Gly Met 

650 655 660 

Glu Tyr Leu Ser Thr Lys Arg Phe He His Arg Asp Leu Ala Ala 

665 670 ~ 675 

Arg Asn Cys Met Leu Asn Glu Asn Met Ser Val Cys Val Ala Asp 

680 685 690 

Phe Gly Leu Ser Lys Lys He Tyr Asn Gly Asp Tyr Tyr Arg Gin 

695 700 705 

Gly Arg He Ala Lys Met Pro Val Lys Trp He Ala He Glu Ser 

710 715 720 
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Leu Ala Asp Arg Val Tyr Thr Ser Lys Ser Asp Val Trp Ser Phe 

725 730 * 735 

Gly Val Thr Met Trp Glu He Ala Thr Arg Gly Gin Thr Pro Tyr 

740 745 750 

Pro Gly Val Glu Asn Ser Glu He Tyr Asp Tyr Leu Arg Gin Gly 

755 . 760 * 765 

Asn Arg Leu Lys Gin Pro Ala Asp Cys Leu Asp Gly Leu Tyr Ala 

770 775 " * * 780 

Leu Met Ser Arg Cys Trp Glu Leu Asn Pro Gin Asp Arg Pro Ser 

785 790 ~ 795 

Phe Thr Glu Leu Arg Glu Asp Leu Glu Asn Thr Leu Lys Ala Leu 

800 805 810 

Pro Pro Ala Gin Glu Pro Asp Glu He Leu Tyr Val Asn Met Asp 

815 820 825 

Glu Gly Gly Gly Tyr Pro Glu Pro Pro Gly Ala Ala Gly Gly Ala 

830 835 840 

Asp Pro Pro Thr Gin Pro Asp Pro Lys Asp Ser Cys Ser Cys Leu 

845 850 ' 855 

Thr Ala Ala Glu Val His Pro Ala Gly Arg Tyr Val Leu Cys Pro 

860 865 870 

Ser Thr Thr Pro Ser Pro Ala Gin Pro Ala Asp Arg Gly Ser Pro 

875 880 ~ * 885 

Ala Ala Pro Gly Gin Glu Asp Gly Ala 

890 " "* 894 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Met Gly Arg Val Pro Leu Ala Trp Trp Leu Ala Leu Cys Cys Trp 
1 5 10 ' - 15 

Gly Cys Ala Ala His Lys Asp Thr Gin Thr Glu Ala Gly Ser Pro 

20 25 30 
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Phe Val Gly Asn Pro Gly Asn lie Thr Gly Ala Arg Gly Leu Thr 

35 40 45 

Gly Thr Leu Arg Cys Glu Leu Gin Val Gin Gly Glu Pro Pro Glu 

50 55 60 

Val Val Trp Leu Arg Asp Gly Gin lie Leu Glu Leu Ala Asp Asn 

65 70 75 

Thr Gin Thr Gin Val Pro Leu Gly Glu Asp Trp Gin Asp Glu Trp 

80 85 90 

Lys Val Val Ser Gin Leu Arg lie Ser Ala Leu Gin Leu Ser Asp 

95 100 105 

Ala Gly Glu Tyr Gin Cys Met Val His Leu Glu Gly Arg Thr Phe 

110 115 120 

Val Ser Gin Pro Gly Phe Val Gly Leu Glu Gly Leu Pro Tyr Phe 

125 130 135 

Leu Glu Glu Pro Glu Asp Lys Ala Val Pro Ala Asn Thr Pro Phe 

140 145 150 

Asn Leu Ser Cys Gin Ala Gin Gly Pro Pro Glu Pro Val Thr Leu 

155 160 165 

Leu Trp Leu Gin Asp Ala Val Pro Leu Ala Pro Val Thr Gly His 

170 175 180 

Ser Ser Gin His Ser Leu Gin Thr Pro Gly Leu Asn Lys Thr Ser 

185 190 * 195 

Ser Phe Ser Cys Glu Ala His Asn Ala Lys Gly Val Thr Thr Ser 

200 205 210 

Arg Thr Ala Thr lie Thr Val Leu Pro Gin Arg Pro His His Leu 

215 220 225 

His Val Val Ser Arg Gin Pro Thr Glu Leu Glu Val Ala Trp Thr 

230 235 240 

Pro Gly Leu Ser Gly lie Tyr Pro Leu Thr His Cys Asn Leu Gin 

245 250 255 

Ala Val Leu Ser Asp Asp Gly Val Gly lie Trp Leu Gly Lys Ser 

260 265 270 

Asp Pro Pro Glu Asp Pro Leu Thr Leu Gin Val Ser Val Pro Pro 

275 280 285 
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His Gin Leu Arg Leu Glu Lys Leu Leu Pro His Thr Pro Tyr His 

290 295 * 300 

lie Arg lie Ser Cys Ser Ser Ser Gin Gly Pro Ser Pro Trp Thr 

305 310 315 

His Trp Leu Pro Val Glu Thr Thr Glu Gly Val Pro Leu Gly Pro 

320 325 330 

Pro Glu Asn Val Ser Ala Met Arg Asn Gly Ser Gin Val Leu Val 

335 340 345 

Arg Trp Gin Glu Pro Arg Val Pro Leu Gin Gly Thr Leu Leu Gly 

350 355 360 

Tyr Arg Leu Ala Tyr Arg Gly Gin Asp Thr Pro Glu Val Leu Met 

365 370 375 

Asp lie Gly Leu Thr Arg Glu Val Thr Leu Glu Leu Arg Gly Asp 

380 385 " 390 

Arg Pro Val Ala Asn Leu Thr Val Ser Val Thr Ala Tyr Thr Ser 

395 400 405 

Ala Gly Asp Gly Pro Trp Ser Leu Pro Val Pro Leu Glu Pro Trp 

410 415 420 

Arg Pro Gly Gin Gly Gin Pro Leu His His Leu Val Ser Glu Pro 

425 430 435 

Pro Pro Arg Ala Phe Ser Trp Pro Trp Trp Tyr Val Leu Leu Gly 

440 445 * 450 

Ala Leu Val Ala Ala Ala Cys Val Leu lie Leu Ala Leu Phe Leu 

455 460 465 

Val His Arg Arg Lys Lys Glu Thr Arg Tyr Gly Glu Val Phe Glu 

470 475 " 480 

Pro Thr Val Glu Arg Gly Glu Leu Val Val Arg Tyr Arg Val Arg 

485 490 ~ " ~ 495 

Lys Ser Tyr Ser Arg Arg Thr Thr Glu Ala Thr Leu Asn Ser Leu 

500 505 510 

Gly lie Ser Glu Glu Leu Lys Glu Lys Leu Arg Asp Val Met Val 

515 520 " 525 

Asp Arg His Lys Val Ala Leu Gly Lys Thr Leu Gly Glu Gly Glu 
!0 530 535 540 



:5 



144 



Phe Gly Ala Val Met Glu Gly Gin Leu Asn Gin Asp Asp Ser lie 

545 550 555 

Leu Lys Val Ala Val Lys Thr Met Lys lie Ala lie Cys Thr Arg 

560 565 570 

Ser Glu Leu Glu Asp Phe Leu Ser Glu Ala Val Cys Met Lys Glu 

575 ' 580 585 

Phe Asp His Pro Asn Val Met Arg Leu lie Gly Val Cys Phe Gin 

590 ~ 595 600 

Gly Ser Asp Arg Glu Gly Phe Pro Glu Pro Val Val lie Leu Pro 

605 610 615 

Phe Met Lys His Gly Asp Leu His Ser Phe Leu Leu Tyr Ser Arg 

620 625 630 

Leu Gly Asp Gin Pro Val Phe Leu Pro Thr Gin Met Leu Val Lys 

635 640 645 

Phe Met Ala Asp lie Ala Ser Gly Met Glu Tyr Leu Ser Thr Lys 

650 655 ' 660 

Arg Phe lie His Arg Asp Leu Ala Ala Arg Asn Cys Met Leu Asn 

665 670 675 

Glu Asn Met Ser Val Cys Val Ala Asp Phe Gly Leu Ser Lys Lys 

680 685 690 

lie Tyr Asn Gly Asp Tyr Tyr Arg Gin Gly Arg lie Ala Lys Met 

695 ~ 700 * 705 

Pro Val Lys Trp lie Ala lie Glu Ser Leu Ala Asp Arg Val Tyr 

710 715 " 720 

Thr Ser Lys Ser Asp Val Trp Ser Phe Gly Val Thr Met Trp Glu 

725 730 " 735 

lie Ala Thr Arg Gly Gin Thr Pro Tyr Pro Gly Val Glu Asn Ser 

740 745 750 

Glu lie Tyr Asp Tyr Leu Arg Gin Gly Asn Arg Leu Lys Gin Pro 

755 760 765 

Val Asp Phe Leu Asp Gly Leu Tyr Ser Leu Met Ser Arg Cys Trp 

770 775 " * 780 

Glu Leu Asn Pro Arg Asp Arg Pro Ser Phe Ala Glu Leu Arg Glu 

785 790 795 
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Asp Leu Glu Asn Thr Leu Lys 

800 

Asp Glu lie Leu Tyr Val Asn 

815 

Glu Pro Arg Gly Ala Ala Gly 

830 

Asp Pro Lys Asp Ser Cys Ser 

845 

Ser Ala Gly Arg Tyr Val Leu 

860 

Thr Leu Ser Ala Asp Arg Gly 

875 

Asp Gly Ala 
888 



Ala Leu Pro Pro Ala Gin 
805 

Met Asp Glu Gly Gly Ser 
820 

Gly Ala Asp Pro Pro Thr 
835 

Cys Leu Thr Ala Ala Asp 
850 

Cys Pro Ser Thr Ala Pro 
865 

Cys Pro Ala Pro Pro Gly 
880 



Glu Pro 
810 

His Leu 
825 

Gin Pro 
840 

Val His 
855 

Gly Pro 
870 

Gin Glu 
885 
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